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We derive the exact relationship, including all non-linearities, between real-space and redshift- 
space two-point statistics through the pairwise velocity distribution function. We show using numer- 
ical simulations that the pairwise velocity PDF is strongly non-Gaussian at all scales, and explain 
why this is so. We caution that a commonly used ansatz to model the redshift-space power spec- 
trum gives rise to an unphysical distribution of pairwise velocities, and show that it is in general 
impossible to derive the distribution from measurements of redshift-space clustering. Methods that 
claim to do this obtain instead something else, whose properties we derive. 

We provide a general derivation of the large-scale limit of the redshift-space power spectrum and 
show that it differs from the Kaiser formula by terms that depend on Gaussian and non-Gaussian 
contributions to the velocity dispersion of large-scale flows. We also show that the large-scale 
evolution of velocity fields is not well described by linear theory and discuss how this impacts the 
redshift-space power spectrum. Finally, we stress that using the monopole of the redshift-space 
power as an indicator of the real-space power spectrum shape can lead to systematic effects in the 
determination of cosmological parameters; nevertheless a simple procedure is able to recover the 
large-scale real-space power spectrum rather well. 



I. INTRODUCTION 

Redshift surveys provide a three-dimensional 
view of the large-scale structure of the universe. 
This view, however, is somewhat distorted due to 
gravitationally-induced pecuhar velocities that con- 
tribute to galaxy redshifts in addition to the smooth 
Hubble flow. These "redshift distortions" compU- 
cate the interpretation of galaxy clustering data 
from redshift surveys but, on the other hand, pro- 
vide a measure of the amount of dark matter in the 
universe (which sources peculiar velocities) due to 
the induced anisotropy of clustering statistics such 
as the power spectrum and the two-point correlation 
function. 

The two main signatures of peculiar velocities 
on the redshift-space clustering pattern have been 
known for a long time [1-4]). At large scales, galax- 
ies that fall into clusters look squashed along the line 
of sight in redshift space: infall velocities of galaxies 
between the cluster and us (between the cluster and 
the rest of the universe) add (substract) to the Hub- 
ble flow. This squashing effect leads to an increase of 
the clustering amplitude along the line of sight, thus 
the power spectrum is enhanced for waves parallel 
to the line of sight [3] . At small scales (compared to 



the size of virialized clusters) the internal velocity 
dispersion elongates clusters along the line of sight, 
leading to the so-called "flnger of god" effect. This 
suppresses the amplitude of waves parallel to the 
line of sight. Therefore, the Fourier space cluster- 
ing pattern shows a positive quadrupole anisotropy 
at large scales that gradually becomes smaller and 
eventually negative as small scales are probed [62]. 

This picture is captured by the "dispersion model" 
for the redshift-space power spectrum, 

where Pg{k) is the real-space galaxy power spec- 
trum, (3 fa r2°„^/&i where bi is the linear bias factor 
between galaxies and mass, /i = k^/k with z denot- 
ing the line of sight direction, and Cp the pairwise 
velocity dispersion assumed to be a constant inde- 
pendent of scale. Here /3 quantifies the squashing ef- 
fect, cTp the velocity dispersion effect. The particular 
form for the squashing effect is due to linear dynam- 
ics and linearized real-to- redshift space mapping [3] ; 
hereafter Kaiser limit); the velocity dispersion fac- 
tor is that corresponding to an exponential pairwise 
velocity distribution function with no mean stream- 
ing [5] . These effects factorize due to the implicit as- 



sumption that they can be treated as mdepcndent. 
Other dispersion models assume different dispersion 
factors (e.g. [6, 7]). 

The model in Eq. (1) is clearly oversimplified for 
a number of reasons, among them 

i) Even in the context of linear dynamics from 
Gaussian initial conditions the squashing fac- 
tor in Eq. (1) must be an approximation. In a 
random Gaussian field, the velocity field fluc- 
tuates from point to point, so there is velocity 
dispersion and thus the squashing effect must 
be necessarily accompanied by some sort of 
dispersion effect. This implies that these ef- 
fects are not independent. 

ii) The dispersion factor introduces a phenomeno- 

logical parameter which represents an ef- 
fective pairwise velocity dispersion, but whose 
value cannot be directly used to constrain 
models since in reality velocity dispersion is 
a function of scale and galaxy bias and is not 
clear how to relate it to the effective value cjp 
affecting the redshift-space power spectrum in 
this model. 

The first point implies that some of the dispersion 
effect may come from large-scale flows (as opposed to 
virial velocities) which can be modeled accurately in 
terms of the primordial power spectrum and cosmo- 
logical parameters. This is important because such 
improvement of the model can add significant con- 
straining power on theories. The second point also 
implies that there is potentially a lot to be gained 
from finding exactly how the redshift-space power 
spectrum depends on non-linear effects from veloci- 
ties and galaxy bias. Some attempts to do this using 
the halo model have been proposed [8-10], but they 
do not address the first point made above. In ad- 
dition, fitting formulae extracted from simulations 
that improve on the dispersion model have been de- 
veloped [11, 12], which although very valuable, they 
do not provide much insight into the problem. 

Despite its limitations, Eq. (1) has been a popular 
model for analyzing redshift surveys to obtain con- 
straints on cosmological parameters (e.g. [13, 14]). 
An alternative to using the dispersion model has 
been to simply ignore the dispersion effect, set- 
ting cTp = in Eq. (1), and argue that on "large 
enough" scales this is sufficiently accurate. Many 



results on cosmological parameters from measure- 
ments of the power spectrum rest on this assumption 
(e.g. [15-18]). Although in the past uncertainties 
from redshift surveys have been large enough that 
such strategies were reasonable, present datasets 
such as 2dFGRS and SDSS demand better accuracy; 
moreover, one expects to get more information than 
just one or two numbers from using the full depen- 
dence of the redshift-space power spectrum on scale 
and direction. 

In this paper we derive an exact formula for the 
redshift-space two-point function and power spec- 
trum in terms of the real space density and velocity 
fields, extending previous work along these lines [19]. 
We also show that this formula obeys a modified ver- 
sion of the "streaming model" , which was previously 
proposed in the small-scale [2] and large-scale [20] 
approximations. This gives a useful characteriza- 
tion of redshift distortions, since real and redshift 
space spectra are then related by the pairwise ve- 
locity probability distribution function (PDF), or 
its Fourier transform. The challenge is then how 
to model this PDF in terms of the linear power 
spectrum, cosmological parameters and galaxy bias. 
Some steps in this direction, modeling the first two 
moments, have been already given by [21, 22] us- 
ing the halo model, see also [23] for a modeling of 
the PDF inspired by perturbation theory. Recent 
work [24] provides a modeling of the pairwise PDF 
starting from that of halos. In addition, we show 
that the model in Eq. (1) leads to an unphysical 
distribution of pairwise velocities, and that infer- 
ring the pairwise velocity PDF from redshift-space 
clustering is unfortunately not possible in general. 
Methods that claim to do this [25-27] recover instead 
something else, whose properties we derive here. 

In this paper we mostly concentrate in the 
large-scale limit, showing using perturbation the- 
ory and N-body simulations that significant cor- 
rections to the redshift-space power spectrum in 
the Kaiser limit are expected at very large scales, 
k > 0.01 hMpc~^. In particular, we emphasize 
that the shape of the redshift-space power spectrum 
monopole is not a good approximation to the shape 
of the linear real-space power spectrum, even when 
fc < 0.1 hMpc~^. We find that weakly non-linear ef- 
fects tend to suppress monopole power increasingly 
with k, and more so for the quadrupole, supporting 
the argument discussed above that at least part of 
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the transition from positive to negative quadrupole 
with increasing k is due to large-scale effects, not just 
virial velocities. We briefly discuss the implications 
of these results for the determination of Qm and the 
reconstruction of the real-space power spectrum. 

Past work along these lines was done by [28-30], 
who considered whether deviations from the Kaiser 
limit at large scales could be due to large-scale ve- 
locities. However, these relied on the Zel'dovich 
approximation, which conserves momentum only to 
linear order, thus velocity fields are not described 
accurately enough to obtain reliable results (see 
e.g. [31]). Studies of the redshift-space power spec- 
trum using dark matter numerical simulations have 
shown significant deviations at large scales from the 
Kaiser limit before (e.g. [31, 32]), but these devi- 
ations have generally been blamed exclusively on 
virial velocities. In fact, as we discus here most of 
the large-scale velocity dispersion is due to weakly 
non-linear dynamics and thus has useful cosmologi- 
cal dependence on O^, cts and the shape of the linear 
power spectrum that can be used to enhance con- 
straints from galaxy clustering in redshift surveys. 

This paper is organized as follows. In section II we 
derive the exact relation between real and redshift 
space two-point statistics, obtain the pairwisc veloc- 
ity PDF in the dispersion model, and discuss the 
recovery of the pairwise velocity PDF from cluster- 
ing measurements. In section III we present the ex- 
act result for the redshift-space two-point correlation 
function in the case of Gaussian random fields and 
compare it to the Kaiser formula. We also present 
measurements of the pairwise velocity moments and 
discuss why Gaussianity is not a good approxima- 
tion even at large scales. In section IV we derive 
the large-scale limit of the redshift-space power spec- 
trum and discuss how it differs from the standard ap- 
proach in the literature. Section V presents results 
from perturbation theory and N-body simulations on 
the weakly nonlinear evolution of velocity fields at 
large scales and why it differs substantially from that 
of the density field. Finally, in section VI we present 
a simple model for the redshift-space power spec- 
trum based on the results of previous sections and 
discuss the recovery of the real-space power spec- 
trum. We summarize all the results in section VII. 
In paper II we present a calculation of the non- 
Gaussian terms in the evolution of pairwise velocities 
and their PDF. 



II. FROM REAL SPACE TO REDSHIFT 
SPACE 



Two-Point Statistics in Redshift Space and 
Pairwise Velocities 



In redshift-space, the observed radial position s of 
an object is given by its radial velocity, which reflects 
its true position due to the Hubble flow plus "distor- 
tions" due to peculiar velocities. The mapping from 
its real-space position x is given by: 



s = X - f u^{x)z, 



(2) 



where / = dlnf/dlna (with D the growth factor 
and a the scale factor) is a function of flm alone 
for open models or flat models with a cosmologi- 
cal constant [63], the scaled velocity field u(x) = 
—v{x)/{Ti.f), with v{x) the peculiar velocity field, 
the comoving Hubble scale, and we have as- 
sumed the "plane-parallel" approximation, so that 
the line of sight is taken as a fixed direction, de- 
noted by z. The density field in redshift space is 
obtained by imposing mass conservation, i.e. 



(1 -I- Ss) (fs =(1 + 6) d?x, 
and thus we have in Fourier space. 



(3) 



5T,{k)+5s{k) 



I 



(2^ 



-ik x ^ifk^UziX) 



l+d{x) 



(4) 

Note that this derivation is exact, it does not make 
any approximations about density or velocity fields; 
the only assumption is that we work in the plane 
parallel approximation, which is trivial to overcome 
by changing kzU^ {k ■ x) {u ■ x). Furthermore, 
since we are only using Eqs. (3-4), there is no refer- 
ence to the Jacobian of the transformation from x 
to s, Eq. (4) is valid even in regions where there is 
multistreaming. In other words, Eq. (4) is taking all 
mass elements at x and putting them at the corre- 
sponding s, if different x's give rise to the same s 
they will be summed over as necessary. 

For the power spectrum, Eq. (4) gives: 
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[l + 5{x)][l + 5{x')]), (5) 

where Auz = Uz{x) — Uz{x') and r = x — x' . In 
configuration space we have 

l + 6(s|h'S±) = J dri\ (^dD{s\\-ri\+ fAuz) 

\l + 5{x)] [l + 5{x')]), (6) 

where the constraint given by the delta function 
takes a pair separated by line-of-sight distance r|| = 
(x — x') ■ z in real space to S|| in rcdshift-spacc as 
given by Eq. (2), with perpendicular separations un- 
changed, s± = r±. Direct Fourier transformation of 
this equation yields Eq. (5) for the power spectrum. 
We can write Eq. (6) in a form closer to that of the 
power spectrum by rewriting the delta function, 

1+Usil,s±) = J ^e-*^('-ii-«ii) (e^^-^^^ 

[l + Six]] [1 + Six')]), (7) 

It is clear from Eqs. (5) and (7) that the basic 
object of interest is the line-of-siglit pairwise velocity 
generating function, A4{X,r), 

[l+^(r)]^(A,r) = (e^^"- [l + 6{x)] [l + 6{x')]), 

(8) 

where we are interested in A = ifk^ in Fourier space, 

or A = if J in configuration space. This generating 
function can be used to obtain the line-of-sight pair- 
wise velocity moments, e.g. 

give the mean and dispersion of the line-of-sight pair- 
wise velocities [64]. The pairwise velocity probabil- 
ity distribution function (PDF), 'P(w), is obtained 



from the moment generating function by inverse 
Fourier transform [65] , 

V{v,r) = f ^ e-^T" M{i^f,r). (11) 

Notice that V{v) depends on scale through the scale- 
dependence of A^, and indeed J dvV{v) v = f vi2{r), 
J dvV{v) = f^af^ir), etc. From Eq. (7) and (11) 
the redshift-space two-point correlation function can 
then be written as 

/oo 
dr\\ [l+CW] P(r|| -S||,r), 
-CO 

(12) 

where = rjj -|- and s± = r±. The physical 
interpretation of this formula is clear: V maps the 
pairs at separation r|[ to separation S|| due to rela- 
tive velocity — H(r|| — sy) [see Eq. (2)] with probabil- 
ity P{r\\ — siijT). This type of relationship between 
the real and rcdshift space correlation fimctions is 
known as the streaming model [2] , though it is com- 
monly written in terms of ^ rather than 1 + ^. If 
V did not depend on scale, both formulations arc 
equivalent, when there is scale dependence (as ex- 
pected in any realistic scenario), the first term in the 
integral for V does not give unity, thus one should 
use Eq. (12) instead. In fact, this contribution to 
has a simple physical interpretation: it corresponds 
to redshift-space density fluctuations generated by 
velocity fluctuations in a uniform (real-space) den- 
sity, i.e. when ^ = 0. li V did not depend on scale, 
random pairs are mapped into random pairs, scale 
dependence means that redshift-space correlations 
are created by taking random pairs in real space 
and mapping them to redshift space differently at 
different scales. 

The streaming model has been mostly used at 
small non-linear scales by assuming V to be an ex- 
ponential with zero streaming velocity and a scale- 
independent isotropic velocity dispersion [33]. At 
large scales, [20] showed that if one assumes the 
streaming model in phase space (with density and 
velocity fields coupled as in linear dynamics), it is 
possible to recover the Kaiser limit for the corre- 
lation function. We will stress in section IV, how- 
ever, that the large-scale limit uses an additional as- 
sumption, that S|| be much larger than the pairwise 
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velocity dispersion. Fisher [20] also claims that in 
the linear regime the relationship between and ^ 
can be reduced to the standard streaming model, 
i.e. as in Eq. (12) with 1 + ^'s replaced by ^'s [see 
his Eq. (26)]. This is incorrect, it suffices to see 
that if this were true all terms in would be pro- 
portional to ^, in particular, such a result does not 
admit redshift distortions generated by correlated 
velocity fluctuations (where V depends on r) in an 
unclustered distribution (^ = 0). 

The power spectrum and two-point correlation 
function in redshift space can be written in a similar 
form, 



Psik) = 



(27r)3 

drwdj 

— f 
27r 



Z(A,r)-l , (13) 



-i7(r||-S||) 



Z(A,r) - 1 



(14) 



where A — ifk^^if^ respectively and 

Z{\,r) = [l + i{r)] M{\,r). (15) 

It is important to note that the two-point corre- 
lation function is affected by redshift distortions for 
all configurations, even those perpendicular to the 
line of sight, since they are coming from different 
scales through the dependence of V on ry . It is how- 
ever possible to project out redshift distortions by 
integrating along the line of sight, 



Tl^ Jo 

2 



dsn ^s{s\\,rj_) 



r± Jo 
TT I P{k) 



kr I 



(16) 



which sets 7 = in Eq. (14). This is only true in 
the plane-parallel approximation, where the concept 
of "line of sight" is applicable. On the other hand, 
the redshift-space power spectrum has the nice prop- 
erty, in the plane-parallel approximation, that trans- 
verse modes are unaffected by redshift distortions (a 
wave in the k± direction is uniform in z and thus 
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FIG. 1: The parallel to the line of sight pairwise velocity 
PDF at redshift z = for pairs separated by distance r, 
measured in the N-body simulations. In the bottom left 
panel, the discontinuous at the origin PDF (thin solid 
line) corresponds to that given by the dispersion model, 
Eq. (19) (ignoring the delta function at the origin). In 
the bottom right panel, the narrow distribution (thin 
solid line) corresponds to the prediction of linear dy- 
namics, Eq. (44). 



unperturbed by the real-to-redshift space mapping) , 
therefore P^(fc^ = 0, /c_l) = ^'(fc_L)- 

Figure 1 shows the pairwise velocity distribution 
V for pairs separated by distance r along the line 
of sight, measured from the VLS simulation of the 
Virgo consortium [34]. This has 512^ dark mat- 
ter particles in a 479 Mpc box with a linear 
power spectrum corresponding to Q.m = 0.3 (includ- 
ing Jib — 0.04 in baryons), JIa = 0.7, h = 0.7 and 
(Tg — 0.9. Due to the large number of pairs (in 
our measurements we use 32 x 10^^ total pairs at 
scales between 0.1 and 300 Mpc h~^) and volume of 
the simulation, the statistical uncertainties are small 
enough that we do not plot error bars for clarity. On 
the other hand, one must keep in mind that neigh- 
boring points, separated by only 20 km/s, must be 
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highly correlated. 

Note that at most scales r ~ 2 — 100 Mpc h^^ 
the distribution is quite skewed (see also the central 
panel in Fig. 3 below for a plot of the skewness S3 as 
a function of scale). This arises as follows: the left 
tail (w < 0) corresponds members of pairs approach- 
ing each other as they fall into an overdensity, the 
right tail (v > 0) corresponds to members of pairs 
receding from each other as they empty underdense 
regions. Most pairs are not inside a void or falling 
coherently into a single structure, therefore the peak 
of V is close to w = 0. The asymmetry between the 
left and right tail gives rise to a mean infall {vi2 < 0), 
that is, it is more probable to find "coherent" pairs 
in overdense than underdense regions. 

Perhaps the most significant feature of V is that it 
has exponential wings at all scales, extending what 
was previously derived in the highly non-linear [35] 
and weakly non-linear [23] regime, the prediction 
of linear perturbation theory (shown as the thin 
solid line in the bottom right panel) is never a good 
approximation, not even in the large-scale limit. 
The reasons for this are discussed in detail in sec- 
tion IIIB. The thin solid lines in the left bottom 
panel show the results of the dispersion model, al- 
though by assumption it has exponential tails, it is 
a poor match to simulations (even though Cp is fit- 
ted to the measured redshift-space power spectrum) 
and represents an unphysical (discontinuous and sin- 
gular) distribution of pairwise velocities. See next 
section for details. 

Figure 2 shows V for pairs separated by distance r 
perpendicular the line of sight. In this case we define 

v± = + Vy, then if V is the PDF for a perpen- 
dicular component of the velocity field (i.e. Vx or Vy, 
it's the same by isotropy and even by symmetry) it 
follows that 

(17) 

For a Gaussian distribution V{vx) = 
(27ra2)-i/2e-"x/2'T^ and thus V{vj^) 
(wj^/fj^) e~"-L/^°' . In this case V has zero skewness, 
by symmetry all odd moments vanish. Apart from 
this, the behavior of V is similar to the parallel 
case, the distribution is non- Gaussian at all scales 




[km/s] [km/s] 

FIG. 2: Same as Fig. 1 but for pairs perpendicular to the 
line of sight. The thin solid lines in the bottom panels 
are as in Fig. 1, the prediction of the dispersion model 
(left) and linear dynamics (right). 



and displays exponential tails. We now turn to a 
discussion of V in the dispersion model. 

B. The Dispersion Model 

It is instructive to recast Eq. (1) in terms of the 
full pairwise velocity distribution that it implies. 
There are two contributions to the pairwise PDF 
in this model, one given by the squashing factor, the 
other by the dispersion factor, with the total PDF 
being the convolution of both PDF's. The Fourier 
transform of the dispersion factor in Eq. (1) corre- 
sponds to a pairwise velocity PDF that is exponen- 
tial, that is 

g-|t)|\/2/(Tp 

Pdisp(«) = y= , (18) 

VZap 

The squashing factor in the Kaiser limit corre- 
sponds to a delta function PDF, see Eq. (60) be- 
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low for a derivation. Performing the convolution of 
this with Eq. (18) leads to the pairwise PDF in the 
dispersion model, 



for modes parallel to the line of sight is not the real- 
space power spectrum multiplied by the generating 
function M.. Instead, from Eq. (13) we get 



r{v) = 



-\v\V2/crp 



72 , 2/^V>, 
1 T fvvi H 2~ 

x[l- V2ap dD(f)]l, (19) 



where the + sign corresponds to w > 0, and the — 
sign to w < (opposite for "t)-velocities" shown in 
Fig. 1), and yj-u denotes the velocity- velocity corre- 
lation function in linear dynamics, = + i^A'^, 
see Eqs. (33) and (38) for explicit expressions. Note 
that the resulting PDF is singular at the origin, and 
in addition has a jump discontinuity at t; = which 
is proportional to Wi2. The bottom left panel in 
Figs. 1 and 2 illustrate this result (omitting the sin- 
gular term at w = 0) and compares it to the mea- 
surements in numerical simulations for a separation 
of r = 10 Mpc/ft.. We have fitted the value of CTj,, 
as it is normally done, to the measured quadrupole 
to monopolc ratio of the redshift-space power spec- 
trum. Despite this fit to the power spectrum, the 
resulting PDF does not fit the simulation results. 
This is hardly surprising, since the dispersion model 
Eq. (1) makes unphysical predictions for the pairwise 
velocity PDF, see Eq. (19). 



C. Recovery of the Pairwise Velocity PDF 
from Redshift-Space Two-Point Statistics 

Given the relationship between the redshift-space 

and real-space correlation function through the pair- 
wise velocity PDF, Eq. (12), it is natural to ask 
whether one can recover information about the PDF 
from clustering measurements. The problem is that 
there is no single PDF involved in Eq. (12), but 
rather an infinite number of PDF's corresponding 
to different scales and angles of the velocities with 
respect to the line joining the pair. If there was no 
scale dependence and anisotropy, all the PDF's are 
the same and Eq. (12) becomes a convolution, thus 
one can find the PDF by deconvolution. In other 
words, due to the scale dependence of the pairwise 
velocity PDF, Eq. (12) is not really a convolution; 
this implies that the redshift-space power spectrum 



n(fc) = Pik)+Miifk,,k) 

+ J d^qM{ifk,,k-q) P{q), (20) 

where [recall that A^(i/A:^, r) = / T'{v,r)e'^^'''dv] 
is basically the double- Fourier transform of V{v,r), 



M{X,p) 



1+e 



X e 



-ipr 



(2^ 



(21) 



except that we substract the zero mode A^(0, r) = 1, 
thus A4{0,p) = 0. For example, in the Kaiser limit 
we have 



M{ifk,,p)^2f 



kzP. 
P 



2 Pseip) + Peeip), 
1^ pi 



(22) 

where Pgg denotes the density-velocity divergence 
power spectrum and Pgg is the velocity divergence 
power spectrum. In linear PT, P = Pss = P50 = 
Pes, but we will keep the distinction because weakly 
non-linear corrections are significant at large scales, 
see section V. 

If we assume that all pairwise moments have 
no scale dependence and are isotropic, which im- 
plies that odd moments vanish (since they must be 
anisotropic, by symmetry odd moments vanish when 
r • z - 0), M{ifk,,p) = [M{ifk,) - 1]5d{p) and 
thus Ps{kz) = P{k) M{ifk). Note that in this 
case M{ifk) is real because odd moments vanish, 
however in general M.{ifk) is complex. By taking 
(even number of) derivatives with respect to A of 
Ps{\/ —X"^ z) / P{\/ —X^) = M{X) one can generate all 
(even) moments and thus find the (symmetric by as- 
sumption) PDF by inverse Fourier transform. 

Galaxy redshift surveys show that P, (kz) / P{k) is 
very close to a Lorentzian, and this has been inter- 
preted as evidence for an exponential pairwise veloc- 
ity PDF [25, 26]. However, realistically one cannot 



7 



neglect anisotropy, since wc know that odd moments 
must be non-zero, in particular there are infall ve- 
locities (fi2 5^ 0) and skewness. The infall velocities 
are small compared to the dispersion at small, non- 
linear scales, however the skewness is expected to be 
significant except in the highly non-linear regime, 
see Fig. 3 below [23, 36-38]. By construction, since 
power spectra are functions of only fc^ and A:^ by 
statistical isotropy, using Ps{kz)/P{k) as a gener- 
ating function (with k = —iX) only generates even 
moments and thus a symmetric answer for its PDF, 
even if the actual PDF is asymmetric [66] . 

To see what this method actually recovers we 

go back to Eq. (20) and write explicitly that the 
redshift-space power spectrum depends on the mag- 
nitude of k and kz, 

Ps{k,kz) = P{k) + ^[M{ifkz,k) + M{-ifkz,-k)] 
+ j \[M{ifkz,k-q) 
+ M{-ifkz,-k-q)] P{q), (23) 

that is, we average together waves with opposite 
wavevectors. In this method the moment generat- 
ing function is identified with the ratio Ps{kz)/ P(k) 
(where we must replace k by —iX in our convention), 
thus it becomes 



(25) 



+ 6z^ al^(r)-\z^ v^^{t)\ [1 + ^(0], 

(26) 

where f = J d^r^(r) is the average of the cor- 
relation function, and the integrals over the volume 
V are cutoff at some large scale (depending on the 
size of the survey and the practical implementation 
of the method). In Eqs. (25-26) to^^'' denotes the 
third moment of the actual PDF and 177.^2^ its fourth 
moment. It is clear from these equations that the 
moments of this pseudo-PDF are weighted versions 
of combinations of several moments of the true PDF, 
so their value is not straightforward to interpret. 
From Eq. (25) we see, for example, that the effec- 
tive value of the velocity dispersion a'^g picked up 
by this method is given by 



d-^r 
(2^ 
[A^(A,r) - l]e-^^ 



[i + ?W] 

[M{-X, r) - l]e^^ 



2P(v^) 



(24) 



It is easy to check that if Al (A) does not depend 
on r, Q{X) = M{X) and thus one can recover by in- 
verse Fourier transform of ^(A) the PDF of pairwise 
velocities, as discussed above. However, in the real- 
istic case with scale dependence and anisotropy, the 
inverse Fourier transform of Q{X) does not give the 
actual PDF. To illustrate this, let us calculate the 
first two non- vanishing moments (second and fourth) 
of this symmetric "pseudo-PDF" , 



^eff 



2 12 
-cr^(r)-h-<TM(r)--r {ii2(r) 



(27) 

where we used that by symmetry ct^2 (^) = ('')+ 
i^^[o-y(r) — cr^(r)] and Vi2{r) = v V\2{t), with v = 
z/r. Therefore, we see that the effective value of 
the velocity dispersion is a weighted version of the 
underlying velocity dispersion minus a contribution 
due to the mean streaming (recall in our convention 
here Vi2{r) > 0, these are "u- velocities" ) , therefore 
one expects this method to yield a biased low value 
of the weighted [by r^(l + mean of the disper- 
sion if not corrected for infall, as stressed by [39] 
and more recently in [27]. Note however that [27] 
also interpret the pseudo PDF as the actual PDF of 
pairwise velocities, and they do not include skewness 
into their treatment. 
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D. The pairwise velocity PDF in terms of its 
building blocks 

We now discuss how to use cuniulant expansions 
to evaluate the pairwise moment generating function 
or Z(X, r) in Eq. (15) in terms of its building blocks, 
the cumulants. The starting point is the property 
between the moment and cumulant C(j) gen- 

erating functions for a set of fields which we group 
together in a vector field A, 

MO) = (e-i-^) = exp (e-i'^)^ = expC( j), (28) 

where A = {Ai,...,An} and similarly for j. 
Derivatives of C generate all the connected corre- 
lation functions. By taking derivatives with respect 
to appropriate components of the vector j, it follows 
in particular that 

(e^'i^Ma) = (e''i^M2)^exp(e'i^^)^, (29) 

(e-'"^^M3)Jexp(e^"^^0e- (30) 

Using ji = X, Ai = Auz, A2 = 5{x) = 6 and 
A3 = 6{x') = 5', this leads to the exact expression, 



It is straightforward to evaluate Eq. (31) in the 
linear regime, for Gaussian fluctuations. In this case, 
the velocity is proportional to the density, whose 
only non-zero cumulant is the second and thus 



AuJ' 



\2 

X expy(Au2) 



(32) 



Notice that even in this case, the resulting ex- 
pression is non-linear in the amplitude of correlation 
functions and does not involve terms of the same or- 
der in linear perturbation theory. Even though fluc- 
tuations are assumed to obey the linear dynamics, 
the non-linear nature of the redshift-space mapping 
leads to a somewhat more complicated picture. We 
will explicitly evaluate Eq. (32) in the next section. 
Non-linear effects due to dynamics lead to significant 
deviations from the predictions of Eq. (32) , even at 
large scales, we discuss this below. An evaluation of 
Eq. (31) is given in paper II. 



III. THE REDSHIFT-SPACE POWER 

SPECTRUM AND CORRELATION 
FUNCTION IN LINEAR DYNAMICS 



Z{X,r) = exp(e^^"^)^ 



1 



.XAU, 



+ e 



(31) 



Note that the overall factor in this expression 
is the moment generating function for the line of 
sight velocity differences, and it is a volume weighted 
quantity (as opposed to the pairwise velocity PDF 
which is mass weighted, by densities at x and 
x'). This velocity-difference PDF is not sensitive 
to galaxy biasing, since it does not depend on the 
density field and even if there is velocity bias inside 
dark matter halos this is a small effect [40-42] and 
halos are in addition suppressed by volume weight- 
ing due to their small size. Therefore, the velocity- 
difference PDF depends on weakly nonlinear dynam- 
ics and thus can be modeled (almost) exclusively in 
terms of cosmological parameters. 



A. Pairwise velocity moments 

We now give an explicit evaluation of Eq. (32). 

Using symmetry considerations, the velocity corre- 
lation function can be written as 



{ui{x + r/2) Uj{x-r/2)) = V_l('') % 

+ [V.||(r)-V±(r)] ^, 
(33) 

where tp^ (r) and tp±{r) are the velocity correlation 
functions parallel and perpendicular to the line of 
sight, respectively. They are related to the velocity 
divergence power spectrum P00{k) through [43] 



Mr) = / ^ ^ m 
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Peejk) 

fc2 



[io{kr) 



(35) 



where ji{x) is the usual spherical Bessel function, 
and we assumed a potential flow, which implies 
tpwir) = d{rtpj_)/dr) and iljj_{r) > i^\\{r). The vari- 
ance of velocity differences reads, 



(Aul) = 2(a', - ^^(r) + ^ A^(r)) , (36) 

which leads to the volume-weighted velocity differ- 
ence moment generating function 



Zo{X,r) = expy(Au^) 



= exp (al - V± (r) + ^ AV'(r)) , 



(37) 



where 



AV'(r) = ^p±{r) - V||(r) = j 



(38) 

and the one-dimensional linear velocity dispersion 
is given by 



'HI 



(39) 



Note that as r ^ 0, ilJ\\{r) = 'ip±ii') = and 
then Zq{X,0) = 1, as expected from its definition. 
On the other hand, as r — > oo, tp\\{r), V'-l(^) ^ 0, 
and then Zo(A, oo) « exp— A^cr^. To evaluate the 
prefactors in Eq. (32), we use that [see Eq. (9)] 



AuM=^)+Hx')]) = v^r) [l+e(r)] 

(40) 

and 



AuJix)) = [AuJix')) = - v^2{r) [1 + C(r)], 

(41) 



then 



Zg = + m ['^ + >^vi2{r) + -vUr)[l+^{r)] 



X exp 



A^(a^-V±(r) + ^ AV^(r)). 



(42) 



B. The Failure of Gaussianity 

It is important to note that, although the large- 
scale limit of Vi2 is well described by linear dy- 
namics (see e.g. [21, 44]) the same is not true for 
the pairwise dispersion, indeed we have [6 = 5{x), 
5' = S{x')] 

^^12(1 + = {Au,^{l + S)il + S')) 

= (Atx.2)(l + ^) + (Au/((5 + (5')) 
+ {Au,^SS')c. (43) 

In linear dynamics, Gaussianity implies that the 
last two terms vanish; however, in reality the third 
moment term contributes a constant in the large- 
scale hmit (r = |a; — a;'| ^ oo) that adds in quadra- 
ture to the contribution of the first term (we eval- 
uate this term in paper II). Therefore, linear the- 
ory never gives a good approximation to the second 
moment of pairwise velocities. That there are non- 
Gaussian corrections should be of no surprise since 
pair weighting means the second moment of pair- 
wise velocities involves up to fourth moments [23], 
the interesting aspect here is that even in the large- 
scale limit non-Gaussian terms persist, e.g. (uz'^S) 
contributes a constant at large scales. 

The top panel in Fig. 3 illustrates this point, where 
(Ti2 is shown as a function of scale for the N-body 
measurements (square symbols) and linear theory 
(dashed lines). All quantities in this figure refer to 
velocity components parallel to the separation vector 
of the pair. It is also important to note that the de- 
pendence of fTi2 on scale is opposite in the linear case 
(decreasing at smaller scales) than in the simulations 
(though at scales r < 1 Mpc/h, ai2 starts decreasing 
in the N-body results). This is also a feature of the 
Gaussian restriction of linear dynamics, as we shall 
discuss in paper II, and it implies that the disper- 
sion effect on the two-point correlation function or 
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FIG. 3: Moments of pairwise velocities parallel to the 
line conecting the pair as a function of scale. Top panel: 
pairwise dispersion cti2 (squares) as a function of scale, 
its connected piece (solid line), and the mean in- 
fall vi2 (triangles). The dashed line denotes the pre- 
dicted ai2 in linear dynamics. Middle panel: dimension- 
less measure of infall (|wi2|/o"i'^2) triangles) compared to 
the skewness of the pairwise velocity PDF (squares); the 
skewness dominates at most scales. Bottom panel: kur- 
tosis as a function scale, note that it does not vanish at 
large scales and S4 > 1 at all scales; the pairwise velocity 
PDF is strongly non-Gaussian at all scales, see Fig. 1. 
For reference, an exponential distribution has S3 = and 
S4 — 3. 



power spectrum will be significantly underestimated. 
Physically, in the Gaussian case as r is decreased the 
velocity field is more correlated and thus (Auj^) de- 
creases; since no correlations between density and 
velocity squared are incorporated in linear theory, it 
is impossible to see that the velocity of pairs in re- 
gions of larger overdensity are fluctuating more; this 
is described by the non-Gaussian third and fourth 
terms in Eq. (43). 

The other panels in Fig. 3 show how important 
non-Gaussianity of the pairwise PDF is. The cen- 
tral panel compares the skewness S3 to the dimen- 



sionless measure of infall, Wi2/cr5^2i where is the 
connected second moment shown by solid lines in 
the top panel. This shows that the skewness is 
more important than infall at most scales (and by 
a large factor at scales where infall is most impor- 
tant). Therefore, modeling the pairwise PDF with 
infall but no skewness, as it is often done (see [20] 
for an exception), is not a good approximation. Fi- 
nally, the bottom panel shows the kurtosis S4 as a 
function of scale, this quantifies that the pairwise 
PDF is strongly non-Gaussian {34 > 1) at all scales, 
and it is basically a manifestation of the exponential 
wings seen in Figs. 1 and 2 at all separations. 

Why is the Gaussian limit of the pairwise veloc- 
ity PDF never reached at large separations? The 
reason is that the relevant quantity is the (density- 
weighted) difference in velocities. At a given separa- 
tion r the velocity difference does not receive contri- 
butions from modes with wavelengths much larger 
than r, since those give the same velocity to x and 
x' . For wavelengths smaller than r the contribution 
of modes is down- weighted only by k^^ (indepen- 
dent of r in the r — > cxd limit); therefore even at 
large separations one is sensitive to nonlinearities. In 
other words, at large separations the velocities are 
uncorrelated and thus the pairwise velocity gener- 
ating function factorizes into individual particle ve- 
locity generating functions. These are sensitive to 
non-linearities, i.e. there is no "large scale" in that 
problem. Thinking in terms of the halo model, at 
large scales the pairwise dispersion is due to parti- 
cles in different halos, each of which has its own (in- 
dependent) one-point dispersion due to virial ("non- 
linear") and halo ("linear") motions, these contribu- 
tions will add in quadrature to give the full disper- 
sion (see [21]). We caution, however, that this split 
is not straightforward, halo motions are not well de- 
scribed by linear theory (their pairwise PDF in the 
large-scale limit is not exactly Gaussian, see [45]). 

In [23] it is argued that exponential tails in the 
pairwise PDF are generated by pair weighting; al- 
though this is in part important, it is not the whole 
story. We show in paper II that the velocity differ- 
ence PDF (which is volume weighted) also has expo- 
nential tails in the large-scale limit, for the reasons 
discussed above. Of course, pair weighting helps 
build non-Gaussianity and it is responsible for the 
deviations in (T12 from linear theory at large scales. 
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C. The Exact Result for Gaussian Random 
Fields 

Even though Gaussianity is not a good approxi- 
mation to describe the statistics of pairwisc veloc- 
ities, it is instructive to discuss the redshift-space 
correlation function in the Gaussian case, both as a 
starting point for more accurate calculations and to 
discuss the regime of validity of the Kaiser limit. 



The only assumptions in deriving Eq. (42) are that 
fluctuations are Gaussian and velocity flows are po- 
tential, i.e. there is no assumption about the am- 
plitude of fluctuations (in practice, of course, Gaus- 
sianity follows only if fluctuations arc vanishingly 
small). It is easy to write down explicitly the pair- 
wise velocity PDF obtained from using Eq. (42) in 
Eq. (11), 



J 



r{v) 



27r/(Awi>- 



cxp 



V Vi2 



/(A?i2) 4V/2(A7 

I 



^12 



A«2 



(1+0 



(44) 



This is not a Gaussian distribution (except when 
Vi2 = at large or small scales, or at all scales for 
separations perpendicular to the line of sight), al- 
though close to its peak it is well approximated by a 
Gaussian centered a,t v = fv\2- Note however that 
the velocity difference PDF is Gaussian, being the 
prcfactor outside the square brackets. The second 
and higher cumulants of the pairwise velocity PDF 
are e.g. 



l + X Ui2 



where 



(x2 



Ul2 (1 



Ul2 



Vl2 



■0 



(46) 



2 , 2 



(47) 



{v\ = -^(l-6^ + a(^^)^ 



(45) 



where (v) = /ui2, and = p{Au1) is the 
variance of the distribution of velocity differences. 
The non-Gaussianity is induced solely by the non- 
linearities in the mapping from real to rcdshift 
space. The two-point function can be written us- 
ing Eq. (12), 



1 + 6(S|hS-L) 



dr\\ 



e 



This is the exact result in the Gaussian limit, and 
has been obtained before by Fisher ([20], Eq. 20) by 
integrating the four dimensional joint Gaussian PDF 
for 5{x), S{x'), Uz{x) and Uz{x'). See also [46, 47]. 
The method described in section II D is an alterna- 
tive way of obtaining the same result with consider- 
ably less algebra, and the advantage that also holds 
in the non-Gaussian case provided the correlators in 
Eq. (31) can be calculated. The analogous result for 
the power spectrum is {y = z/r, k± = k^/l — /x^, 
r_L = rVl — J^^), 



P«(fc,M) 



if'"*/' 



du Jo[k±r±) cos{krfj,u) [.2q'^"(A, r, ;/) — 1] +sm{kriJ,i>) 



r,u) , (48) 
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which involves a 2D rather than ID integration. 
Here Z^'^{X,r, i/) corresponds to the term propor- 
tional to A in Eq. (42), and ^^"'"(A, r, ly) is the rest. 
In order to obtain power spectrum multipoles it is 
sometimes more convenient to calculate first multi- 
poles of the correlation function, 

Ur) = J'^ dv Ur, ^) LeH, (49) 

where Li denote the Legendre polynomials, and then 
using the plane-wave expansion (/i = k^/k, v = z/r) 



^-rk r ^ ^(_,)^(2£+ 1) j^(kr) L,{^i) (50) 

£=0 

obtain from them the power spectrum multipoles, 

Piik) - ^ dr r'Mkr) ^r)- (51) 

In this way, a 3D numerical integration gives both 
the redshift-space correlation function and power 
spectrum. 

Figure 4 shows the result for the redshift-space 
correlation function ^s(s||, s_l) in the exact Gaussian 
case Eq. (46) (solid) and the Kaiser limit (dashed), 
Eq. (61) below. Notice that there are significant 
deviations even at large scales, predominantly at 
small sj_, we explain why this is the case in sec- 
tion IV below. Since the correlation amplitude is 
so much smaller in this region compared to small 
S||, when multipoles are calculated integrating along 
fixed s the results are close to their Kaiser limit 
values. It is apparent that the qualitative behav- 
ior of the corrections are to make the contours less 
squashed, as expected from the effects of the veloc- 
ity dispersion. This is evident in Fig. 5 which zooms 
into small scales, the dispersion effect is obvious 
(the quadrupole has opposite sign from that at large 
scales). Note that this happens at very small scales 
because the pairwise dispersion decreases at small 
scales (see dashed line in Fig. 3), thus one needs to 
go to tiny scales before sy becomes smaller than the 
pairwise dispersion. In addition, one can see that 
the Gaussian result is not close to the Kaiser limit 
even when the amplitude of the correlation function 




[Mpc/h] 

FIG. 4: Contours of ^3(s||,Si) for the Exact Gaussian 
result (solid) and the Kaiser limit (dashed), for a fiat 
ACDM cosmological model {flm = 0.26, erg = 0.9, fib ~ 
0.04, h = 0.7) with linear bias 6i = 1. 



is much smaller than unity, e.g. see ~ 0.002 in 
the left panel in Fig. 4. 



As shown in Fig. 3 and discussed above, assuming 
Gaussianity is not a good approximation, therefore 
these results are not a substantial improvement over 
the Kaiser limit, and we do not show corresponding 
results for the power spectrum. What is interest- 
ing here is that it gives some idea of how to incor- 
porate the effects of large-scale velocity dispersion; 
we will come back later to this when we develop 
a simple approximation to the redshift-space power 
spectrum. We now turn into a discussion of the as- 
sumptions behind the Kaiser limit, and show how 
our approach differs from the standard derivations 
of it in the literature. 
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FIG. 5: Same as Fig. 4 but at small scales. Note the 
dispersion effect: a Gaussian random field does show 
"fingers of god" even though the velocity dispersion de- 
creases monotonically towards small scales (see dashed 
line in Fig. 3). 



IV. THE LARGE-SCALE LIMIT 
A. Derivation 

The non-trivial part of Eq. (46), and Eq. (12) in 
general, is that as one integrates along rj| one is inte- 
grating over a different PDF due to scale dependence 
and anisotropy. The relationship between and ^, 
Vi2 and (7^2 {{'^ul) in linear dynamics) is non-local, 
however at large scales one can express S,s in terms 
of local second moments by the following procedure, 
which leads to a derivation of the Kaiser formula and 
makes clear its regime of validity. 

What do we exactly mean by "large scales"? Al- 
though ^ and Vi2 vanish in the large-scale limit, 
does not (and (Au^) is largest at large scales), there- 
fore we are not allowed to do a small amplitude 
expansion in this case. On the other hand, when 
S|| ^ /o'i2 the integration over ry will be sharply 
peaked about rn = sn , thus we can "expand real 



space about redshift space" , 



V{v;r«) w V{v;sn) + {rn - sn) 



dV{v;sn) 



(52) 



Note that since this expansion can be done for 
any PDF (not just the one corresponding to linear 
dynamics), we will do so in general, our results here 
apply to the fully non-linear case. Similarly one can 
expand ^(r) « ^(s) -I- . . ., and using that v = r\\ — 
[see Eq. (12)], keeping up to second derivatives we 
obtain 



1+6 « (i+0(i + M2 + Y^?2" + 



(53) 



where all quantities in the right hand side are eval- 
uated at s and derivatives are with respect to sy, 
e.g. v'i2 = dvi2{s) / ds\\. Keeping only terms linear 
in quantities that vanish in the large-scale limit gives 



^^"^?2U, (54) 



where cr^2loo is the large-scale limit of the pairwise 
dispersion. In Fourier space this reads, 



Ps{k) « P55{k)(l 



-f'^Prr'^ I 
2^ '^z"l2\oD 



ifkz vi2{k) 
(55) 



Higher-order derivatives are suppressed by higher 
powers of k^. Expanding real space about red- 
shift space should work well when the derivatives 
in Eq. (54) are small (fc^ is small), i.e. when con- 
sidering waves with k with a small component with 
respect to the line of sight in which case the distor- 
tions are small. The large scale limit of vi2 is given 
by linear theory, Wi2(fe) = —2ikzPse{k)/k'^, whereas 
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for (7^2 both Gaussian and non-Gaussian terms con- 
tribute. We calculate the non-Gaussian terms in pa- 
per II, for our purposes here let us just write erf 2 \oo = 
2{<jI + A„) and al^ik) = -2PMk)/k^ + B,{k), 
then we have 

Ps{k) = Ps5{k) (1 - fk^ia^ + A,)) + 2/ ^ Pseik) 
+f^^Pee{k)-\pklB,{k), (56) 

where the non-Gaussian terms correspond to = 
{ul5) and = FT(Aii2(5 + 5' + M'))c, where FT 
stands for Fourier transform. We show in paper 
II that in the large-scale limit, B^ « (8/35)(4 + 
ll/i^/3)cr^P(fc). A„ corresponds to the differ- 
ence in the large scale limit of 17^2 to the linear 
value (squares compared to dashed lines in Fig. 3), 
whereas B^j is the non-Gaussian contribution that 
takes into account that the scale dependence of the 
pairwise velocities is opposite to that in linear the- 
ory, i.e. increasing toward smaller scales, as a result 
it counters the effect of the Pee term. Prom Eq. (56) 
it follows that when 

klfal-^l, or fc/x<0.2/iMpc-\ (57) 

where we assumed a flat ACDM model, for which 
al w 40 (Mpc h-'^f and / « 0.5 at ^ = 0, one re- 
covers the Kaiser formula [3] for the power spectrum 
(the reason why we don't assume Pss — Pse — Pee 
will become clear in section V), 

Ps{k) = Pss{k) + 2fi?Pse{k) + ffi^Peeik). (58) 

The condition in Eq. (57) says that, unless one 
considers modes nearly perpendicular to the line 
of sight /i ~ 0, velocity dispersion effects become 
important for wavenumbers much smaller than the 
non-linear scale. Note that at kz ~ 0.2 h Mpc~^ 
the velocity dispersion terms become of order unity 
almost reversing the enhancement of the rcdshift- 
space power spectrum. These additional terms have 
important dependencies on cosmological parameters 
that are different from those in the Kaiser formula, 
for example A^- ~ 61 al and B^, ~ bia^ in the large 
scale limit, where 61 is the linear bias, with as 



depending also on the shape of the power spectrum. 
This can help break degeneracies present in Eq. (58). 

Note that although Eq. (58) has the right limit at 
kz — 0, giving the real-space power spectrum, the 
second derivative (which is the first non-vanishing) 
with respect to kz does not (except at fc = 0), as 
this is sensitive to velocity dispersion effects, both 
Gaussian and non-Gaussian. It is useful to recast 
Eq. (58) in terms of what it implies for the pairwise 
velocity PDF. To do this, we can expand Eq. (42) 
for small A (recall A = ifkz in Fourier space), 

^G«l + ^(r)+At;i2(r) + y (Aw^^. (59) 

This implies that the pairwise velocity PDF in the 
Kaiser limit has the form [see Eq. (11)], 

that is, it corresponds to a very sharply peaked PDF, 
since the dispersion (Aul) is effectively assumed 
to be vanishingly small. This is the result used in 
Eq. (19) to derive the pairwise PDF in the disper- 
sion model, and when put into Eq. (12) gives the 
two-point function [20, 48]) 

^.(.||,^.)=e(.) + /^^i2(.) + ^^(A..^). 

(61) 

It is interesting to go back to Fig. 4 and compare 
the exact result for Gaussian random fields to the 
Kaiser formula. The expansion in Eq. (52) is best 
when the scale dependence of the PDF is small. This 
is going to be less safe for smaller s± , since for large 
s_L variations in ry as one integrates enter only in 
quadrature in = r"^^ + s^, whereas for s_l ~ 
variations in ry enter linearly into s. This is the 
analogous situation to having kz not small in Fourier 
space, and this is why the largest deviations in Fig. 4 
happen near sj_ = 0, even at large scales. 

Finally, a few words of caution about the expan- 
sion in Eq. (52). This converts integration over an 
infinite number of PDF's into a single one and its 
derivatives, thus significantly simplifying the cal- 
culation. Note however than in order to arrive 
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to Eq. (54) one must interchange the order of the 
derivatives and integrals over the PDF and integrate 
term by term. Such a procedure is not strictly valid, 
since it is very likely that the expansion in Eq. (52) 
does not converge uniformly. Indeed, in the Gaus- 
sian case one is expanding Eq. (42) for small A, and 
the exponential series has zero radius of convergence, 
thus term by term integration is not mathematically 
valid. Note also that at the end, terms that were 
supposed to be of increasing order in a small param- 
eter in Eq. (52) end up being of the same order of 
magnitude in Eq. (58). 

B. Comparison with the standard derivation 

Let us now compare our derivation of the Kaiser 
limit with the standard approach ([3, 32, 49]) which 
makes explicit use of the Jacobian J = \dsi/dxj\ 
of the mapping from real to redshift space. In the 
plane-parallel approximation, J{x) = |1 — /VzMzI 
and from Eq. (3) it follows that 1 + 5s{s) = [1 -|- 
5{x)\/ J{x). Now if we assume fV zUz ^ li we can 
expand 1/J{x) ~ 1 + fVzUz, and thus linearizing in 
the field amplitudes if follows, 

ds{s)c^d{x) + fVzUz{x), (62) 

which, using that V • u = (5 in linear dynamics, and 
s ~ a; to leading order, in Fourier space leads to 

^,(fe) = ^(fe)(l + //x2). 

There arc several steps in this derivation which are 
unjustified, namely, the density and velocity gradi- 
ents at a given point in space are not small (i.e. for 
CDM models their linear variance at a point is much 
larger than imity), note that there is no smoothing 
involved until after one makes these approximations. 
In particular 5{x) can be large inside dark matter 
halos and similarly VzUz, which will also fluctuate 
in sign. What is small is the correlation between 
fields separated by large distances, not the field am- 
plitudes themselves. By making approximations at 
the level of density and velocity fields one gets in- 
correct correlations, in the sense that the velocity 
dispersion of a Gaussian random field never appears 
in this approach. The derivation presented in sec- 
tion II A and IV A shows that it is unnecessary to 
assume anything about the Jacobian of the trans- 



formation or the amplitude of density and velocity 
gradients. 

V. NON-LINEAR EVOLUTION OF 
DENSITY AND VELOCITY FIELDS 

The expansion leading to Eq. (56) has little to 
do with nonlinear dynamics (only involved in gen- 
erating the non-Gaussian terms), but rather with 
the nonlinearities of the real to redshift-space map- 
ping. We now explore the corrections induced in the 
redshift-space power spectrum due to non-linear evo- 
lution of the density and velocity fields. We shall see 
that the velocity field is affected more significantly 
than the density field at large scales due to larger 
sensitivity to tidal gravitational fields. 

We are interested in calculating the non-linear 
evolution of density and velocity divergence auto and 
cross spectra and comparing to numerical simula- 
tions. Measuring the volume-weighted velocity di- 
vergence power spectrum in numerical simulations 
is not straightforward at small scales. Interpolating 
the particles velocities to a grid gives the momentum 
(density-weighted velocities); in order to obtain 9{k) 
one possibility would be to 

i) Fourier transform the momentum, and di- 
vide the Fourier coefficients by the interpola- 
tion window ("sharpening" of the momentum 
Fourier coefficients). 

ii) do the same for the density field, and then 
transform back to real space density and mo- 
mentum fields. 

iii) divide momentum by density at each grid 
point. Fourier transform the resulting volume- 
weighted velocity field and calculate the diver- 
gence in Fourier space. 

This procedure is not ideal for several reasons. 
First, there is the choice of the interpolation scheme: 
one would like to choose a low-order interpola- 
tion scheme because it docs not smooth out fields 
too much (so sharpening only affects the highest- fc 
modes), on the other hand, a low-order interpolation 
scheme gives rise to many grid points with zero den- 
sity and momentum, thus the velocity field cannot 
be defined there. Using a high-order interpolation 
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scheme bypasses this problem, but leads to some grid 
points with negative density after step ii), due to 
the fact that sharpening can be numerically unsta- 
ble in voids. A more practical procedure is to divide 
the interpolated momentum by the interpolated den- 
sity (both of which have been similarly affected by 
the interpolation window), Fourier transform that, 
and without applying any corrections (since inter- 
polation corrections in numerator and denominator 
should roughly cancel), calculate the divergence of 
the velocity field. This procedure is safe to the ex- 
tent that gives results independent of the interpo- 
lation scheme. We have tried second (CIC), third 
(TSC) and fourth-order interpolation schemes with 



similar results: at large scales k < 0.3 h Mpc^^ the 
different procedures give the same power spectrum, 
for smaller scales the results obtained start to de- 
pend on the particular scheme used. It would be 
interesting to try using Delaunay or Voronoi tesse- 
lation techniques [50] to see whether this can be im- 
proved for smaller scales, but our procedure is sim- 
pler and works well at large scales. 

We now present the calculation of the density 

and velocity divergence aiito and cross power spec- 
tra using one-loop PT. In linear PT, by definition 
Pss{k) = Pseik) = Pse{k) = P{k). Non-linear cor- 
rections break this degeneracy, giving 



Pssik) = P{k) + 2 j [F2{p, q)fP{p)P{q)d\ + 6P{k) j F^ik, q)P{q)d\ (63) 
Pee{k) = P{k)+2 j [G2{p,q)fP{p)P{q)d\ + &P{k) j G^{k,q)P{q)d^q (64) 
Pse{k) = P{k) + 2 / F2{p, q)G2{p, q)P{p)P{q)d^q + 3P{k) [ [Fs{k, q) + Gs{k, q)]P{q)d'q, (65) 



where p = k — q. The first term of non- linear correc- 
tions describes the contribution to the power spec- 
trum at k due coupling between modes q and p, 
whereas the second term corresponds instead to cor- 

I 



F2{k,q) = 
G2{k,q) = 



where 1^2 = 34/21 and IJ.2 = 26/21 represent the 
second-order evolution in the spherical collapse dy- 
namics. The other two terms in these kernels have a 
different physical origin: the middle term is due to 
the non-linear transformation from following mass 
elements to studying the dynamics at fixed spa- 



rections to the linear growth factor that depend on 
k. The kernels F2 and G2 can be written as {k = k/k 
and similarly for q) 



{qiQj - 3<^y) (66) 

lSij){q,qj-yij) (67) 

I 

tial position ( "Lagrangian to Eulerian space" map- 
ping), the last term represents the effect of the tidal 
gravitational fields, since (kikj — ^Sij)S{k) is the 
Fourier representation of the tidal gravitational field 
ViVj$(a;) - |(5ijV^$(a;), where $ is the gravita- 
tional potential. The important thing to notice here 
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is that velocity fields are more sensitive to tidal 
fields, the coefficient of the last term in G2 is twice 
that in i^2, and consequently they evolve less by 
spherical collapse (that's why ^2 is smaller than V2 
to exactly compensate) and therefore do not grow 
as fast due to non-linear effects, in fact, we shall see 
that non-linear growth is significantly smaller than 
linear at the scales we are interested in. 

The F3 and terms can be analyzed in a simi- 
lar way, but they are more complicated, instead we 
just write down their expression after the angular 
integration over k ■ q has been done. 



6k^ - 79k^q^ + 50Pq^ - 2lq^ 
{q^ -Pf{7q^ + 2P) 



In 



k + q 



k — q 



(68) 



G3(fc,g) 



21fc2g4 
(q2-fc2)3(q2^2fc2) 



14A:3g5 



In 



k + q 



k — q 



(69) 



These terms are negative and the magnitude of 
G3 is larger than F3. This leads to an overall sup- 
pression of Pee{k) compared to linear theory. Fig- 
ure 6 shows the results of these calculations (solid 
lines) and measurements in numerical simulations 
(symbols), expressed as ratios to the linear power 
spectrum Piin{k). One- loop PT for Pss{k) per- 
forms significantly worse than for spectra with no 
baryonic wiggles, though it does seem to track the 
variations seen in the simulations, about 10% for 
k < 0.2 h Mpc~^, at least in a qualitative sense. 
The situation is significantly better for Pse{k), but 
this good agreement appears to be to some extent 
an accident, a cancellation between too large correc- 
tions for Pssik) and Pee{k) with opposite signs. 

These results can be understood qualitatively and 
to some extent quantitatively as well by consider- 
ing one-loop PT for scale-free initial conditions with 
Pii„(fc) = fc". In this case [51], 



Pxy{k) = Piin{k) l + axy{n)[ — 



k \"+3 



(70) 
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FIG. 6: Non-linear corrections to the density- density 
(top), density- velocity (middle) and velocity- velocity 
(bottom) power spectra as a lunction ol scale. The sym- 
bols denote measurements in the VLS dark matter simu- 
lations, solid lines denote one-loop perturbation theory. 



where fcni is the nonlinear scale defined form the lin- 
ear power spectrum, and x and y denote any of S, 9. 
The functions a are decreasing functions of n, pos- 
itive for n sufficiently negative and negative for n 
sufficiently positive; the sign of a describes whether 
the nonlinear growth is faster or slower than in lin- 
ear theory. Corrections to Pgs (Pee) are positive 
for n < —1.4 (n < —1.9) and negative otherwise 
(see Fig. 12 in [52] for plots of ass and agg). For 
CDM models close to the nonlinear scale at e.g. 
k — 0.1 h Mpc~^, the effective spectral index is 
'^off ~ —1.35, which being close to the critical in- 
dex for 5 where corrections to Pss{k) vanish, leads 
to a small negative correction to Pss- On the other 
hand, the situation is very different for 9 that has a 
critical index of —1.9, thus the large negative correc- 
tions to Pge [67]. This has a significant impact on 
the large-scale redshift-space power spectrum. For 
more discussion of nonlinear corrections along these 
fines see e.g. [51-55]. 
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FIG. 7: Contours of the redshift-space power spectrum 
at z = 0. The sohd hues correspond to the N-body sim- 
ulation resuhs, dashed hues denote the Kaiser formula, 
and dotted lines show the simplified ansatz of Eq. (71). 



VI. THE REDSHIFT-SPACE POWER 
SPECTRUM 



not correct even at large scales, as the pairwise veloc- 
ity PDF is significantly non-Gaussian at all scales. 
Going beyond this however requires and evaluation 
of the pairwise PDF in the non-Gaussian case, which 
is addressed in paper II. We try to compensate for 
this by keeping a constant Gaussian velocity dis- 
persion suppression factor given by linear dynam- 
ics; this is an improvement over the incorrect scale 
dependence in linear dynamics and partially mimics 
the effect of non-Gaussian terms. But it is clearly 
an oversimplification. Note that although at first 
sight Eq. (71) looks similar to the phenomenological 
model of [6], it is in fact rather different: we do not 
fit for a velocity dispersion factor, but rather ct^ is 
predicted by linear dynamics and depends on as and 
the shape of the power spectrum; also, we incorpo- 
rate the difference in evolution between density and 
velocity fields at large scales, as seen in Fig. 6. 

Figure 7 shows the results of such an exercise, 
compared to the numerical simulation results (sym- 
bols) and to the Kaiser formula (dashed) . Although 
the improvement is significant there are still some 
deviations, which is not surprising given our ap- 
proximate treatment. In particular, Eq. (71) does 
not give enough suppression at intermediate angles. 
The suppression of power at /i = 1 works reasonably 
well, and it is due to the velocity dispersion and the 
nonlinear corrections to Pse and Pge; for example at 
k = 0.1 h Mpc""'^ each effects suppresses power at 
fj, — 1 hy the same amount, about 10% each. 



A. A simple model 

We now put together the results discussed above 
to see how well one can match the large-scale 
redshift-space power spectrum with the information 
we have so far, without resorting to an evaluation of 
the PDF of pairwise velocities in the non-Gaussian 
case. Specifically, we use the following ansatz, 

Ps{k) - (^Pss{k) + 2f^i^Pse{k) + fpi^Pee{k)) 
X eM-fk^al), (71) 

where Pss, Pse and Pgg refer to the non-linear spec- 
tra, see Fig. 6. We only include velocity dispersion 
effects using the large-scale limit in the Gaussian 
case; as discussed at the end of section III A this is 



B. Recovering the real-space power spectrum 

An important question is to what extent one can 
recover the real-space power from measurements of 
the redshift-space power spectrum. Attempts to 
do this fall into two different approaches: one is 
to measure the projected correlation function ^p, 
Eq. (16), by integrating the redshift-space correla- 
tion function along the line of sight [56-58] , the other 
is to try to measure the power for modes perpen- 
dicular to the line of sight cither by smoothly ap- 
proaching /.t = [59] or at large scales by using the 
Kaiser formula to go from multipoles to the real- 
space power [17, 60]. Here we explore the conditions 
of validity of the latter. 

First, it is important to note that even if one is 
interested in just the shape of the real-space power 
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k [h Mpc-'] 

FIG. 8: Top panel: Ratio of the monopole (solid) and 
quadrupole (dashed) to the predictions of the Kaiser for- 
mula, for redshit-space power given by Eq. (71). Symbols 
show the same quantities in the numerical simulations. 
Bottom panel: recovery of the real-space power spectrum 
from redshift-space multipoles according to Eq. (73) for 
the model in Eq. (71) (solid) and N-body simulations 
(symbols). 



spectrum (and not its amplitude), it should be clear 
from the results presented so far that there is no 
reason to expect that the monopole of the redshift- 
space power should have the same shape as the real- 
space power, at least not to the accuracy of current 
large surveys such as 2DFGRS and SDSS. The top 
panel in Fig. 8 illustrates this point for the model of 
Eq. (71) and N-body simulations, where we com- 
pare their monopole and quadrupole to those in 
the Kaiser limit, = P{k){l + ffi^f with P{k) 
the nonlinear real-space power spectrum. Note that 
even at k = 0.1 h Mpc^^ the monopole is suppressed 
by about 10% and the quadrupole by 35 — 40%. 
From Fig. 8 we can see again that our model un- 
derestimates the suppression when compared to nu- 
merical simulations. In principle the situation for 
galaxies could be different than shown in Fig. 8, but 



for close to unbiased galaxies there is no reason why 
it should be drastically different than for the model 
in Eq. (71), given that we only include velocity dis- 
persion due to large-scale flows and nonlinear cor- 
rections to 9 should not be affected by biasing, be- 
ing a volume weighted velocity [see discussion after 
Eq. (31)]. We stress that ignoring the suppression 
of power at large scales can contribute to systematic 
effects in the determination of shape parameter, the 
spectral tilt and running of the spectral index, or 
constraints on the neutrino mass. 

The good news is that an "inverse use" of the 
Kaiser formula has a larger regime of validity than 
one might expect based on the results discussed so 
far. As long as we can approximate the redshift- 
space power spectrum with only £ = 0,2,4 multi- 
poles we can always write 

P,ik) = P{k)[l + 2A2{k)^i^ + Aiik)fi^] (72) 

with A2{k) and A4(fc) some arbitrary functions of 
k. One can think of these functions as scale depen- 
dent versions of / or /3 when bias is present, i.e. in 
the Kaiser limit Ai{k) = [A2(fc)]2 = p"^. The in- 
teresting piece of information is that recovering the 
real-space power spectrum from the redshift-space 
multipoles in the case of arbitrary A2 and A4 is that 
is still given by the same linear combination as in 
the Kaiser limit, 

Pik)=Po{k)-^P2ik) + ^Piik), (73) 

even for arbirtrary A2{k) and A4{k), since Eq. (73) 
only uses orthogonality of Legendre polynomials up 
to ^ = 4. Equation (73) is thus far more general 
than assuming the Kaiser limit, basically the linear 
combination at each scale is done using the effective 
value of /? at that scale implied by A2 and A4. The 
reason why this is useful is that higher than i = A 
multipoles are generated only for k > 0.2 h Mpc~^ 
since they are suppressed by higher powers of k^ in 
the large-scale expansion, see Eq. (52). An exam- 
ple of the effectiveness of using Eq. (73) is given in 
the bottom panel of Fig. 8, where we use it to re- 
construct the real power in the case of the model in 
Eq. (71) and for the N-body measurements, which 
do not have the form of Eq. (72), since the exponen- 
tial generates all multipoles higher than £ — 4 with 
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roughly equal amplitude in the high-fc limit and even 
more so for the simulation. Nonetheless, the recov- 
ery of the real-space power is quite successful for 
A; < 0.2 — 0.3 h Mpc~^, a bit worse for the simula- 
tion that has a larger velocity dispersion everywhere 
compared to the Gaussian value (see top panel in 
Fig. 3). 

The approach of using Eq. (73) to recover the 
real-space power spectrum was implemented already 
in [17, 60]. Of course the use Eq. (73) can be ex- 
tended to include higher multipoles if possible, this 
will increase the regime of validity of the reconstruc- 
tion. Note however that the nice property of recov- 
ering Pss does not extend to Pse and Pgg, e.g. us- 
ing the same idea one obtains jP2{k) — ^P4{k) = 
A2{k)P{k), which cannot be interpreted as Pso with 
the same degree of accuracy due to the effects of 
velocity dispersion. 



VII. CONCLUSIONS 

We have derived the exact relationship between 
two-point statistics in real and redshift space in 
terms of the statistics of pairwise velocities. This 
is given by Eq. (12) for the two-point correlation 
function in terms of the pairwise velocity PDF, and 
by Eq.(13) for the power spectrum in terms of the 
pairwise velocity generating function. These results 
include all non-linearities in the dynamics and the 
real-to-redshift space mapping, the only approxima- 
tion made is that distortions are plane-parallel. The 
radial distortion case can be dca'ived by similar rea- 
soning to that in section II A. Higher-order correla- 
tion functions in redshift space can also be studied 
along the same lines. 

We also showed that, 

i) The pairwise velocity PDF is strongly non- 
Gaussian at all scales (Figs. 1 and 2). The 
failure to reach Ganssianity at large scales is 
related to the fact that difference of velocities 
between members of a pair are always sensitive 
to modes whose wavelength is smaller than the 
distance of separation. 

ii) The often used dispersion model, Eq. (1), gives 
rise to an unphysical distribution of pairwise 
velocities (see bottom left panel in Fig. 1). 



iii) It is impossible in general to derive the PDF 
of pairwise velocities from measurements of 
redshift-space clustering. Methods that claim 
to do this obtain instead something else, whose 
properties we derive, see Eqs. (25-27). 

iv) The exact result for the redshift-space corre- 
lation function of a random Gaussian field is 
significantly different from the Kaiser formula 
at large scales for pairs parallel to the line of 
sight (Fig. 4). 

v) The large-scale limit of the redshift-space 
power spectrum in the general case differs 
from the Kaiser formula by terms that depend 
on Gaussian and non-Gaussian contributions 
to the velocity dispersion of large-scale flows, 
Eq. (55). 

vi) There are significant nonlinear corrections to 
the evolution of velocity fields at scales much 
larger than the "nonlinear scale" (Fig. 6). 
These are due to the sensitivity of veloci- 
ties to tidal gravitational fields, which sup- 
press the growth relative to linear perturbation 
theory and have a significant impact on the 
redshift-space power spectrum. These correc- 
tions should be included when modeling large- 
scale velocity flows. 

vii) The monopole of the redshift-space power 
spectrum does not provide a good measure of 
the shape of the real-space power spectrum 
(top panel in Fig. 8). Ignoring this can lead 
to systematic effects in the determination of 
the spectral tilt, running of the spectral index, 
and limits on the neutrino mass. 

viii) The real-space power spectrum can be recov- 
ered at large scales by the standard procedure 
based on the orthogonality of multipoles (bot- 
tom panel in Fig. 8). 

We have ignored the problem of galaxy biasing, 
although linear bias is of course trivial to intro- 
duce; it is interesting to note in this regard that 
non-Gaussian terms give a different dependence on 
cosmological parameters that can be used to break 
degeneracies. Nonlinearities in the bias between 
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galaxies and dark matter can lead to nontrivial be- 
havior, this will be explored elsewhere. An impor- 
tant gap that remains is the derivation of the large- 
scale limit of the PDF of pairwise velocities, this is 
a difficult problem that will be addressed in paper 
II. This should allow a more physical modeling of 
the redshift-space power spectrum along the lines of 
section VIA, where we assumed (incorrectly) that 
Gaussianity holds at large scales. The usefulness 
of such a model is that it allows for the correlation 
that exists between the squashing and dispersion ef- 
fects, which so far have been taken as independent in 
the modeling of redshift distortions, such as Eq. (1). 
The correlation between both effects depends on 
and the shape and normalization of the power spec- 
trum. Using this information is essential to extract 
the full information encoded in the anisotropy of the 
redshift-space power spectrum, which on physical 
grounds must be poorly described by just two inde- 
pendent numbers such as f3 and an effective velocity 
dispersion ap. 
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